AITopics | bilevel problem

On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization

Neural Information Processing SystemsJun-16-2026, 00:07:29 GMT

In this paper, we study the problem of solving a simple bilevel optimization problem, where the upper-level objective is minimized over the solution set of the lower-level problem. We focus on the general setting in which both the upper-and lower-level objectives are smooth but potentially nonconvex. Due to the absence of additional structural assumptions for the lower-level objective--such as convexity or the Polyak-Łojasiewicz (PL) condition--guaranteeing global optimality is generally intractable. Instead, we introduce a suitable notion of stationarity for this class of problems and aim to design a first-order algorithm that finds such stationary points in polynomial time. Intuitively, stationarity in this setting means the upper-level objective cannot be substantially improved locally without causing a larger deterioration in the lower-level objective. To this end, we show that a simple and implementable variant of the dynamic barrier gradient descent (DBGD) framework can effectively solve the considered nonconvex simple bilevel problems up to stationarity.

artificial intelligence, bilevel problem, machine learning, (18 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

Projection-Free Methods for Stochastic Simple Bilevel Optimization with Convex Lower-level Problem

Neural Information Processing SystemsApr-25-2026, 03:10:13 GMT

In this paper, we study a class of stochastic bilevel optimization problems, also known as stochastic simple bilevel optimization, where we minimize a smooth stochastic objective function over the optimal solution set of another stochastic convex optimization problem. We introduce novel stochastic bilevel optimization methods that locally approximate the solution set of the lower-level problem via a stochastic cutting plane, and then run a conditional gradient update with variance reduction techniques to control the error induced by using stochastic gradients. For the case that the upper-level function is convex, our method requires O(max{1/ϵ2f,1/ϵ2g}) stochastic oracle queries to obtain a solution that is ϵfoptimal for the upper-level and ϵg-optimal for the lower-level. This guarantee improves the previous best-known complexity of O(max{1/ϵ4f,1/ϵ4g}). Moreover, for the case that the upper-level function is non-convex, our method requires at most O(max{1/ϵ3f,1/ϵ3g})stochastic oracle queries to find an (ϵf,ϵg)-stationary point. In the finite-sum setting, we show that the number of stochastic oracle calls required by our method are O( n/ϵ) and O( n/ϵ2) for the convex and non-convex settings, respectively, where ϵ = min{ϵf,ϵg}.

artificial intelligence, machine learning, probability 1, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.86)

Add feedback

0aa800df4298539770b57824afc77a89-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 15:17:30 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry:

Education (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

0aa800df4298539770b57824afc77a89-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 15:17:26 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Neur2BiLO: Neural Bilevel Optimization

Neural Information Processing SystemsMar-21-2026, 19:39:40 GMT

Bilevel optimization deals with nested problems in which takes the first decision to minimize their objective function while accounting for a's best-response reaction. Constrained bilevel problems with integer variables are particularly notorious for their hardness. While exact solvers have been proposed for mixed-integer bilevel optimization, they tend to scale poorly with problem size and are hard to generalize to the non-linear case. On the other hand, problem-specific algorithms (exact and heuristic) are limited in scope. Under a data-driven setting in which similar instances of a bilevel problem are solved routinely, our proposed framework, Neur2BiLO, embeds a neural network approximation of the leader's or follower's value function, trained via supervised regression, into an easy-to-solve mixed-integer program. Neur2BiLO serves as a heuristic that produces high-quality solutions extremely fast for four applications with linear and non-linear objectives and pure and mixed-integer variables.

artificial intelligence, machine learning, proceedings, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

d4dd111a4fd973394238aca5c05bebe3-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 08:16:38 GMT

Under the new analysis, to achieve an -stationary point of the nested problem, it requiresO( 2)samples in total.

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)
Asia > Middle East > Jordan (0.04)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)

Add feedback

Projection-Free Methods for Stochastic Simple Bilevel Optimization with Convex Lower-level Problem

Neural Information Processing SystemsFeb-8-2026, 02:55:14 GMT

However, all the above results are limited to the deterministic setting.

artificial intelligence, machine learning, probability 1, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Arizona (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)

Add feedback

0aa800df4298539770b57824afc77a89-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 16:24:09 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.46)

Industry:

Education (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Resolving the Tug-of-War: A Separation of Communication and Learning in Federated Learning

Neural Information Processing SystemsFeb-7-2026, 16:24:04 GMT

Therefore, we perform the learning and communication over the same set of parameters.

artificial intelligence, arxiv preprint arxiv, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Maryland > Prince George's County > College Park (0.04)

Genre: Research Report (0.46)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz Condition

Neural Information Processing SystemsDec-26-2025, 18:35:44 GMT

Bilevel optimization has recently regained interest owing to its applications in emerging machine learning fields such as hyperparameter optimization, meta-learning, and reinforcement learning. Recent results have shown that simple alternating (implicit) gradient-based algorithms can match the convergence rate of single-level gradient descent (GD) when addressing bilevel problems with a strongly convex lower-level objective. However, it remains unclear whether this result can be generalized to bilevel problems beyond this basic setting. In this paper, we first introduce a stationary metric for the considered bilevel problems, which generalizes the existing metric, for a nonconvex lower-level objective that satisfies the Polyak-Łojasiewicz (PL) condition. We then propose a Generalized ALternating mEthod for bilevel opTimization (GALET) tailored to BLO with convex PL LL problem and establish that GALET achieves an $\epsilon$-stationary point for the considered problem within $\tilde{\cal O}(\epsilon^{-1})$ iterations, which matches the iteration complexity of GD for single-level smooth nonconvex problems.

alternating optimization method, bilevel problem, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

bilevel problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization

Projection-Free Methods for Stochastic Simple Bilevel Optimization with Convex Lower-level Problem

0aa800df4298539770b57824afc77a89-Supplemental-Conference.pdf

0aa800df4298539770b57824afc77a89-Paper-Conference.pdf

Neur2BiLO: Neural Bilevel Optimization

d4dd111a4fd973394238aca5c05bebe3-Paper.pdf

Projection-Free Methods for Stochastic Simple Bilevel Optimization with Convex Lower-level Problem

0aa800df4298539770b57824afc77a89-Supplemental-Conference.pdf

Resolving the Tug-of-War: A Separation of Communication and Learning in Federated Learning

An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz Condition